CDS
Accession Number | TCMCG027C19890 |
gbkey | CDS |
Protein Id | XP_035548000.1 |
Location | complement(join(42971186..42971254,42971868..42971984,42972143..42972303,42976582..42976717,42977522..42977724,42977940..42978096,42978186..42978360,42978729..42978917,42979023..42979121,42980741..42980863,42981197..42981328,42988169..42988257,42989411..42989468,42989564..42989799,42999094..42999192,42999407..42999574,42999653..42999745,43000236..43000394,43001895..43002020,43002148..43002241,43002783..43002920,43003838..43003907,43004500..43004867,43005819..43005898)) |
Gene | LOC109001958 |
GeneID | 109001958 |
Organism | Juglans regia |
Protein
Length | 1112aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA350852 |
db_source | XM_035692107.1 |
Definition | nuclear pore complex protein NUP107 isoform X2 [Juglans regia] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGACGTTCGCATGGACACTTCTCCCAGCTATTTCGACCCTGAAGACCTCACCATCAGGGAGCCATTCCGCCGATATGGGAAAAGGCACTCAGCATCAAGCTCATCATCCTATCATGAACACTCAACTTCTAAGTACAGTGGATCCAGGCTATTGTACGATGGACATAGCATCCATAGCCCAACTAATGCTGCCTTACTTCTTGAAAATATCAAACAAGAGGTGGAGAGCATTGACGCTGATCGCTCGGAAGGAACACCTGCAAGGACACTATCTGCTTCTAAAAGGAGACCGCCCATTGATAGTCATGGAATGGCTGACATGGATGCTGGTGCTGATTCAGTTCGCTATTTGCTGAAAGCTTGCAAGCATGAGGATGATTCATTGGCAGATGGTGGAGACACCACTTTTACCATATTTGCATCGCTACTTGATTCCGCATTTCAAGGTTTGATGTCTATTCCTGATATCATACTAAGATTTGAGGGATCTTGCCGAAATGTTTCAGAGTCAATTAGGTATGGGTCCAATGTAAGGCATGGGGCCGTAGAGGAGAAATTGATGAGGCAGAAGGCTCAGCTCCTGCTTGATGAGGCTGCTTCCTGGTCTCTTTTGTGGTACCTTTATGGGAAAGGAAATAAATCTTTAATCTCTTCAACTCTGATTTTAAAGATGAAATATTTGGTTTTAGTGCATGATGATTTGGTTCTTCTTGCAGTGTCTGAAGAACTCCCGAAAGAGCTAATCCTGTACCCCCAAACATCACACTTGGAGGCCTGCCAGTTTGTTGCAGATGATCATACAGCACAATTGTGCCTCCGGATTGTTCAATGGCTTGAGGGTTTAGCCTCCAAAGCACTTGATTTGGAAAGCAAGGTGCGAGGATCTCATGTTGGTACCTATCTACCCAGCTCTGGTGTTTGGCATCATACTCAACGGTATCTTAAGAAAGGAGCCTCCAATAAAAATACCATTAATCACTTGGATTTTGATGCTCCAACACGTGAACATGCTCATCAGCTACCTGATGACAAAAAACAAGATGAATCTCTTTTGGAAGATGTCTGGACTCTTCTAAGGGCTGGAAGACTGGAAGAAGCATGTGATCTTTGCCGGTCAGCCGGACAGCCATGGAGAGCTGCAACATTGTGCCCATTTGGGGGGTTGGACCTTTTCCCTTCTGTTGAAGCCGTGGTGAATAATGGAAAGAGTAGGATTTTGCAAGCCATTGAACTGGAAAGTGGCCCAGGACACCAATGGCGTCTTTGGAAATGGGCTTCCTACTGTGCATCTGAGAAAATTGTGGAGCAAGATGGTGGTAAATTTGAAGCAGCTGTATATGCAGCACAGTGTGGCAATTTAAAGCGCATGCTTCCAATCTGCACAGACTGGGAGTCTGCATGCTGGGCAATGGCAAAATCATGTCTAGATGTTCAGGTGGATTTGGAACTAGCTCATCTACAACCCGGGAGAGTGGATCAGATGAGCCGTTTTGGAGATGTAGCTGATGGAAGTCCTGGAAATCTCAATGGTGGTCATCAGCCTCCTGTTGGGTCGGAAAGTTGGCCAGTCCAAGTTCTGAACCAGCAGCCACGACAACTTTCTGCTCTTCTTCAGAAGCTTCATTCAACGGAAACAGTGCACGAAAGTGTTGCTCGTGGATGCAAAGAGCAGCAACGTCAAATTGAGATGATTATGATGTTAGGGGATGTACCACGCCTGCTGGACCTTATATGGTCGTGGATAGCACCTTCAGAAGATGATCAAAATGTATTCAGGCCTCATGGAGATCCCCAAATGATTCGATTTGGTGCACACCTAGTGCTTGTACTTAGACATTTACTTGCTGACGAAATGAAGGATGCTTTCAGAGAAAAAATAATGAATGTTGGTGATCTCATTCTACACATGTATGCAATGTTTCTTTTTTCAAAGCAGCATGAGGAGTTAGTGGGCATATATGCTTCTCAGCTTGCACATCACCGTTGCATTGACCTCTTTGTGCATATGATGGAACTGAGGCTGAACAGCGGGGTGCATGTCAAATATAAGATCTTCCTTTCTGCCATGGAGTACTTACCATTTTCTCCTGGGGACGATTCAAAAGGCAGTTTTGAGGAAATTATTGAGAGGGTATTGTCAAGATCTCGGGAGATCAAAGTTGGTAAATATGATAAGTTATCAGATGTACTAGAACAACACCGGCAGCAGGGTCTTCAAAAAGCTATGGTTATCCAATGGCTTTGCTTTACACCCCCATCCACAATTGCCGATGTTAAGGATGTTAGCACAAAACTTCTTTTGCGAGCCCTAATACATAGCAATATATTATTCAGGGAGTTTGCTCTGATTTCCATGTGGAGAGTGCCAGCAGTGCCAATTGGTGCGCACACATTACTCAGTCTTCTTGCTGAACCTTTGAAGCTGCTTTCAGATAATCATGATATGTTGGAGGATTATGATATTTCTGATAATTTGAGAGAGTTCCAAGACTGGAGTGAGTATTACTCTTGTGACGCAACATATCGGAATTGGCTCAAAATTGAATTGGAGAATGCAGAGGTTTCACCCCTAGAACTCTCACAGGAGGAAAAACAAAGAGCCATTTTAGCGGCCAAAGAGACACTGAATGCATCTCTCTCACTTCTACTAAGAAAAGGAAATCCTTGGTTGGCCGCAACTGAAGATCATATATATAAATCAGAAGAGCCTTTATTTCTAGAATTGCATGCCACAGCAATGCTCTGCCTTCCGACTGGTGAATGTATGTCTCCAGATGCCACAATGTGCACTACTTTAACAAGTGCCCTTTACTCTTCAGTGAGCGAGGAAGTTGTGTTAAATCGACAACTAATGGTGAATGTCTCCATATCAACGAGGGACAATTACTGCATTGAGGTTGTACTGCGATGCTTGGCAGTAGCAGGTGATGGAATTGGGCCTCAGGAAGATGGTGGCATACTCAGTACTGTTTTGGCTGCTGGCTTCAAAGGTGAGCTTCTTCGATTTCAAGCTGGAGTTACGATGGAGATCTCACGATTAGATGCTTGGTATTCAAGCAGAGATGGTTCTTTAGAAGGCCCAGCAACATACATTGTACGGAGTCTTTGTCGTAGGTGCTGTCTTCCAGAAGTCATTCTTCGATGCATGCAAGTATCAGTTTCGCTCATGGAGTCGGGTGATCCACCTGAAAGCCATGATGATTTGATTGAACTAGTTGCGTGCCCTGATAGTGGATTTCTTCATTTGTTTAGCCAACAACAATTGCAGGAATTCTTGCTGTTTGAGAGGGAATACTCAATATTCAAAATGGAGCTTCAGGAGGAACTTTGTTCTTGA |
Protein: MDVRMDTSPSYFDPEDLTIREPFRRYGKRHSASSSSSYHEHSTSKYSGSRLLYDGHSIHSPTNAALLLENIKQEVESIDADRSEGTPARTLSASKRRPPIDSHGMADMDAGADSVRYLLKACKHEDDSLADGGDTTFTIFASLLDSAFQGLMSIPDIILRFEGSCRNVSESIRYGSNVRHGAVEEKLMRQKAQLLLDEAASWSLLWYLYGKGNKSLISSTLILKMKYLVLVHDDLVLLAVSEELPKELILYPQTSHLEACQFVADDHTAQLCLRIVQWLEGLASKALDLESKVRGSHVGTYLPSSGVWHHTQRYLKKGASNKNTINHLDFDAPTREHAHQLPDDKKQDESLLEDVWTLLRAGRLEEACDLCRSAGQPWRAATLCPFGGLDLFPSVEAVVNNGKSRILQAIELESGPGHQWRLWKWASYCASEKIVEQDGGKFEAAVYAAQCGNLKRMLPICTDWESACWAMAKSCLDVQVDLELAHLQPGRVDQMSRFGDVADGSPGNLNGGHQPPVGSESWPVQVLNQQPRQLSALLQKLHSTETVHESVARGCKEQQRQIEMIMMLGDVPRLLDLIWSWIAPSEDDQNVFRPHGDPQMIRFGAHLVLVLRHLLADEMKDAFREKIMNVGDLILHMYAMFLFSKQHEELVGIYASQLAHHRCIDLFVHMMELRLNSGVHVKYKIFLSAMEYLPFSPGDDSKGSFEEIIERVLSRSREIKVGKYDKLSDVLEQHRQQGLQKAMVIQWLCFTPPSTIADVKDVSTKLLLRALIHSNILFREFALISMWRVPAVPIGAHTLLSLLAEPLKLLSDNHDMLEDYDISDNLREFQDWSEYYSCDATYRNWLKIELENAEVSPLELSQEEKQRAILAAKETLNASLSLLLRKGNPWLAATEDHIYKSEEPLFLELHATAMLCLPTGECMSPDATMCTTLTSALYSSVSEEVVLNRQLMVNVSISTRDNYCIEVVLRCLAVAGDGIGPQEDGGILSTVLAAGFKGELLRFQAGVTMEISRLDAWYSSRDGSLEGPATYIVRSLCRRCCLPEVILRCMQVSVSLMESGDPPESHDDLIELVACPDSGFLHLFSQQQLQEFLLFEREYSIFKMELQEELCS |